Source | # of sentences | Average logarithmic rank |
---|---|---|
http://rss.feedsportal.com/c/860/f/517321/s/c86f902/l/0Lar0Brian0Bru0Ceconomy0C20A10A0A80A40C1273682630Bhtml/story01.htm | 11 | 5.20 |
http://rss.feedsportal.com/c/860/f/517321/s/d2abff5/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A8270C1275845830Bhtml/story01.htm | 12 | 5.35 |
http://www.elaph.com/Web/NewsPapers/2009/5/442391.htm | 86 | 5.39 |
http://www.bbc.co.uk/go/wsy/syn/rss/2.0/-/arabic/middleeast/2010/09/100928_israelprimeminister.shtml | 11 | 5.55 |
http://rss.feedsportal.com/c/860/f/517321/s/d41fcb4/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A830A0C1276121370Bhtml/story01.htm | 17 | 5.56 |
http://rss.feedsportal.com/c/860/f/517321/s/cd77ca0/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A8160C127470A0A470Bhtml/story01.htm | 13 | 5.59 |
http://rss.feedsportal.com/c/860/f/517321/s/b665eeb/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A6250C1268429790Bhtml/story01.htm | 12 | 5.62 |
http://ar.rian.ru/analytics/articles/20100512/126263052.html | 13 | 5.66 |
http://rss.feedsportal.com/c/860/f/517321/s/d4147a7/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A830A0C12760A84410Bhtml/story01.htm | 13 | 5.67 |
http://rss.feedsportal.com/c/860/f/517321/s/d3f95a3/l/0Lar0Brian0Bru0Ceconomy0Cfinances0C20A10A0A830A0C12760A64260Bhtml/story01.htm | 14 | 5.68 |
http://rss.feedsportal.com/c/860/f/517321/s/cbfa79f/l/0Lar0Brian0Bru0Csociety0C20A10A0A8120C1274442750Bhtml/story01.htm | 22 | 5.69 |
http://rss.feedsportal.com/c/860/f/517321/s/cbfa8a7/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A8120C1274443780Bhtml/story01.htm | 15 | 5.69 |
http://rss.feedsportal.com/c/860/f/517321/s/c5237a9/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A7280C12727180A60Bhtml/story01.htm | 13 | 5.69 |
http://rss.feedsportal.com/c/860/f/517321/s/b6f4f94/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A6280C1268715220Bhtml/story01.htm | 14 | 5.70 |
http://rss.feedsportal.com/c/860/f/517321/s/b7f1f5d/l/0Lar0Brian0Bru0Crus0C20A10A0A70A10C1269317240Bhtml/story01.htm | 11 | 5.71 |
http://rss.feedsportal.com/c/860/f/517321/s/be498ee/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A7130C1270A837330Bhtml/story01.htm | 13 | 5.72 |
http://rss.feedsportal.com/c/860/f/517321/s/becbd2e/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A7140C1270A932810Bhtml/story01.htm | 17 | 5.72 |
http://www.alriyadh.com/2010/08/26/article554865.html | 19 | 5.72 |
http://ar.rian.ru/analytics/articles/20100510/126214959.html | 11 | 5.73 |
http://rss.feedsportal.com/c/860/f/517321/s/d0b01e0/l/0Lar0Brian0Bru0Cdisasters0C20A10A0A8230C12754790A90Bhtml/story01.htm | 11 | 5.74 |
http://rss.feedsportal.com/c/860/f/517321/s/d1bdfa2/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A8250C1275698510Bhtml/story01.htm | 12 | 5.74 |
http://rss.feedsportal.com/c/860/f/517321/s/d7293c8/l/0Lar0Brian0Bru0Cpolicy0Carabic0Iaffairs0C20A10A0A90A50C1276675180Bhtml/story01.htm | 16 | 5.74 |
http://www.bbc.co.uk/go/wsy/syn/rss/2.0/-/arabic/business/2010/09/100916_usa_poverty.shtml | 24 | 5.74 |
http://rss.feedsportal.com/c/860/f/517321/s/c77d19a/l/0Lar0Brian0Bru0Cpolicy0Cforeign0C20A10A0A80A20C1273384280Bhtml/story01.htm | 16 | 5.75 |
http://rss.feedsportal.com/c/860/f/517321/s/b74798e/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A6290C1268976820Bhtml/story01.htm | 13 | 5.76 |
http://rss.feedsportal.com/c/860/f/517321/s/bf531f7/l/0Lar0Brian0Bru0Canalytics0Carticles0C20A10A0A7150C12710A550A40Bhtml/story01.htm | 17 | 5.76 |
http://rss.feedsportal.com/c/860/f/517321/s/c6c3cdb/l/0Lar0Brian0Bru0Ceconomy0C20A10A0A7310C1273217840Bhtml/story01.htm | 14 | 5.76 |
http://ar.rian.ru/analytics/articles/20100518/126334083.html | 16 | 5.77 |
http://ar.rian.ru/rus/20100515/126315543.html | 14 | 5.77 |
http://www.alriyadh.com/2010/08/17/article552292.html | 29 | 5.77 |
Source | # of sentences | Average logarithmic rank |
---|---|---|
http://www.alriyadh.com/2010/08/03/article548813.html | 13 | 9.11 |
http://www.alriyadh.com/2010/08/03/article548798.html | 16 | 9.09 |
http://www.csc-sy.net/node/11966 | 36 | 9.09 |
http://www.alriyadh.com/2010/08/09/article550409.html | 15 | 8.99 |
http://www.alriyadh.com/2010/08/07/article549945.html | 32 | 8.92 |
http://www.alriyadh.com/2010/08/11/article550816.html | 16 | 8.92 |
http://www.alriyadh.com/2010/08/11/article550899.html | 20 | 8.83 |
http://www.alriyadh.com/2010/08/25/article554461.html | 49 | 8.80 |
http://www.alriyadh.com/2010/08/31/article556197.html | 11 | 8.77 |
http://www.alriyadh.com/2010/08/03/article548881.html | 28 | 8.74 |
http://www.alriyadh.com/2010/11/08/article575378.html | 14 | 8.68 |
http://www.alriyadh.com/2010/08/22/article553749.html | 20 | 8.65 |
http://www.alriyadh.com/2010/08/22/article553598.html | 28 | 8.63 |
http://www.elaph.com/Web/ElaphFashion/2009/5/436994.htm | 109 | 8.61 |
http://www.alriyadh.com/2010/08/29/article555556.html | 17 | 8.61 |
http://www.alriyadh.com/2010/08/17/article552359.html | 14 | 8.59 |
http://www.alriyadh.com/2010/08/09/article550460.html | 24 | 8.58 |
http://www.alriyadh.com/2010/08/26/article554910.html | 24 | 8.58 |
http://www.alriyadh.com/2010/05/21/article527773.html | 11 | 8.58 |
http://www.alriyadh.com/2010/06/20/article536410.html | 11 | 8.57 |
http://www.alriyadh.com/2010/09/02/article556603.html | 35 | 8.57 |
http://www.alriyadh.com/2010/08/14/article551589.html | 30 | 8.57 |
http://www.alriyadh.com/2010/08/10/article550693.html | 29 | 8.56 |
http://www.alriyadh.com/2010/09/06/article557473.html | 13 | 8.56 |
http://www.alriyadh.com/2010/08/02/article548553.html | 12 | 8.55 |
http://www.alriyadh.com/2010/08/25/article554486.html | 12 | 8.54 |
http://www.alriyadh.com/2010/08/10/article550741.html | 18 | 8.54 |
http://www.alriyadh.com/2010/08/24/article554358.html | 12 | 8.53 |
http://www.alriyadh.com/2010/08/27/article555053.html | 26 | 8.53 |
http://www.alriyadh.com/2010/09/07/article557857.html | 16 | 8.52 |
In this subsection we replace average word length by average logarithmic word rank. The logarithm of the word rank is taken because we want to punish words of high ranks only moderately.
First table:
select source, count(distinct i_s.s_id) as cnt_s, round(avg(log(w.w_id-100)),2) as av from sources so, inv_so i_s, inv_w i, words w where so.so_id=i_s.so_id and i_s.s_id=i.s_id and i.w_id=w.w_id and w.w_id>100 group by source having cnt_s>10 order by av LIMIT 30;
6.4.2.1 Average word length for different sources
6.4.2.3 Sources consisting of many / few words with frequency 1
6.4.2.4 Sources with low / high average word length of rare words